arXiv:0909.1801vl  [cs.IT]  9  Sep  2009 


i 


Randomized  Sensor  Selection 
in  Sequential  Hypothesis  Testing 
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Abstract — We  consider  the  problem  of  sensor  selection  for 
time-optimal  detection  of  a  hypothesis.  We  consider  a  group  of 
sensors  transmitting  their  observations  to  a  fusion  center.  The 
fusion  center  considers  the  output  of  only  one  randomly  chosen 
sensor  at  the  time,  and  performs  a  sequential  hypothesis  test.  We 
consider  the  class  of  sequential  tests  which  are  easy  to  implement, 
asymptotically  optimal,  and  computationally  amenable.  For  three 
distinct  performance  metrics,  we  show  that,  for  a  generic  set  of 
sensors  and  binary  hypothesis,  the  fusion  center  needs  to  consider 
at  most  two  sensors.  We  also  show  that  for  the  case  of  multiple 
hypothesis,  the  optimal  policy  needs  at  most  as  many  sensors  to 
be  observed  as  the  number  of  underlying  hypotheses. 

Index  Terms — Sensor  selection,  decision  making,  SPRT, 
MSPRT,  sequential  hypothesis  testing,  linear-fractional  program¬ 
ming. 

I.  INTRODUCTION 

In  today’s  information-rich  world,  different  sources  are 
best  informers  about  different  topics.  If  the  topic  under  con¬ 
sideration  is  well  known  beforehand,  then  one  chooses  the 
best  source.  Otherwise,  it  is  not  obvious  what  source  or 
how  many  sources  one  should  observe.  This  need  to  identify 
sensors  (information  sources)  to  be  observed  in  decision 
making  problems  is  found  in  many  common  situations,  e.g., 
when  deciding  which  news  channel  to  follow.  When  a  person 
decides  what  information  source  to  follow,  she  relies  in  general 
upon  her  experience,  i.e.,  one  knows  through  experience  what 
combination  of  news  channels  to  follow. 

In  engineering  applications,  a  reliable  decision  on  the  un¬ 
derlying  hypothesis  is  made  through  repeated  measurements. 
Given  infinitely  many  observations,  decision  making  can  be 
performed  accurately.  Given  a  cost  associated  to  each  obser¬ 
vation,  a  well-known  tradeoff  arises  between  accuracy  and 
number  of  iterations.  Various  sequential  hypothesis  tests  have 
been  proposed  to  detect  the  underlying  hypothesis  within  a 
given  degree  of  accuracy.  There  exist  two  different  classes 
of  sequential  tests.  The  first  class  includes  sequential  tests 
developed  from  the  dynamic  programming  point  of  view. 
These  tests  are  optimal  and,  in  general,  difficult  to  imple¬ 
ment  0.  The  second  class  consists  of  easily-implementable 
and  asymptotically-optimal  sequential  tests;  a  widely-studied 
example  is  the  Sequential  Probability  Ratio  Test  (SPRT) 
for  binary  hypothesis  testing  and  its  extension,  the  Multi¬ 
hypothesis  Sequential  Probability  Ratio  Test  (MSPRT). 
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In  this  paper,  we  consider  the  problem  of  quickest  decision 
making  using  sequential  probability  ratio  tests.  Recent  ad¬ 
vances  in  cognitive  psychology  m  show  that  the  performance 
of  a  human  performing  decision  making  tasks,  such  as  ’’two- 
alternative  forced  choice  tasks,”  is  well  modeled  by  a  drift 
diffusion  process,  i.e.,  the  continuous-time  version  of  SPRT. 
Roughly  speaking,  modeling  decision  making  as  an  SPRT 
process  is  somehow  appropriate  even  for  situations  in  which 
a  human  is  making  the  decision. 

Sequential  hypothesis  testing  and  quickest  detection  prob¬ 
lems  have  been  vastly  studied  lH71.  ||4j.  The  SPRT  for  binary 
decision  making  was  introduced  by  Wald  in  ED,  and  was 
extended  by  Armitage  to  multiple  hypothesis  testing  in  HD. 
The  Armitage  test,  unlike  the  SPRT,  is  not  necessarily  op¬ 
timal  (24).  Various  other  tests  for  multiple  hypothesis  test¬ 
ing  have  been  developed  throughout  the  years;  a  survey  is 
presented  in  ED-  Designing  hypothesis  tests,  i.e.,  choosing 
thresholds  to  decide  within  a  given  expected  number  of 
iterations,  through  any  of  the  procedures  in  lfj~8l  is  infeasible  as 
none  of  them  provides  any  results  on  the  expected  sample  size. 
A  sequential  test  for  multiple  hypothesis  testing  was  developed 
in  0,  ED,  and  ED,  which  provides  with  an  asymptotic 
expression  for  the  expected  sample  size.  This  sequential  test  is 
called  the  MSPRT  and  reduces  to  the  SPRT  in  case  of  binary 
hypothesis. 

Recent  years  have  witnessed  a  significant  interest  in  the 
problem  of  sensor  selection  for  optimal  detection  and  estima¬ 
tion.  Tay  et  al  ED  discuss  the  problem  of  censoring  sensors 
for  decentralized  binary  detection.  They  assess  the  quality  of 
sensor  data  by  the  Neyman-Pearson  and  a  Bayesian  binary 
hypothesis  test  and  decide  on  which  sensors  should  transmit 
their  observation  at  that  time  instant.  Gupta  et  al  ED  focus  on 
stochastic  sensor  selection  and  minimize  the  error  covariance 
of  a  process  estimation  problem.  Isler  et  al  ED  propose 
geometric  sensor  selection  schemes  for  error  minimization 
in  target  detection.  Debouk  et  al  0  formulate  a  Markovian 
decision  problem  to  ascertain  some  property  in  a  dynamical 
system,  and  choose  sensors  to  minimize  the  associated  cost. 
Wang  et  al  [l22l  design  entropy-based  sensor  selection  algo¬ 
rithms  for  target  localization.  Joshi  et  al  ED  present  a  convex 
optimization-based  heuristic  to  select  multiple  sensors  for 
optimal  parameter  estimation.  Bajovic  et  al  0  discuss  sensor 
selection  problems  for  Neyman-Pearson  binary  hypothesis 
testing  in  wireless  sensor  networks. 

A  third  and  last  set  of  references  related  to  this  paper 
are  those  on  linear-fractional  programming.  Various  iterative 
and  cumbersome  algorithms  have  been  proposed  to  optimize 
linear-fractional  functions  0,  0-  In  particular,  for  the  prob¬ 
lem  of  minimizing  the  sum  and  the  maximum  of  linear- 
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fractional  functionals,  some  efficient  iterative  algorithms  have 
been  proposed,  including  the  algorithms  by  Falk  et  al  G2  and 
by  Benson  (61- 

In  this  paper,  we  analyze  the  problem  of  time-optimal  se¬ 
quential  decision  making  in  the  presence  of  multiple  switching 
sensors  and  determine  a  sensor  selection  strategy  to  achieve 
the  same.  We  consider  a  sensor  network  where  all  sensors  are 
connected  to  a  fusion  center.  The  fusion  center,  at  each  instant, 
receives  information  from  only  one  sensor.  Such  a  situation 
arises  when  we  have  interfering  sensors  (e.g.,  sonar  sensors), 
a  fusion  center  with  limited  attention  or  information  processing 
capabilities,  or  sensors  with  shared  communication  resources. 
The  fusion  center  implements  a  sequential  hypothesis  test 
with  the  gathered  information.  We  consider  two  such  tests, 
namely,  the  SPRT  and  the  MSPRT  for  binary  and  multiple 
hypothesis,  respectively.  First,  we  develop  a  version  of  the 
SPRT  and  the  MSPRT  where  the  sensor  is  randomly  switched 
at  each  iteration,  and  determine  the  expected  time  that  these 
tests  require  to  obtain  a  decision  within  a  given  degree  of 
accuracy.  Second,  we  identify  the  set  of  sensors  that  minimize 
the  expected  decision  time.  We  consider  three  different  cost 
functions,  namely,  the  conditioned  decision  time,  the  worst 
case  decision  time,  and  the  average  decision  time.  We  show 
that  the  expected  decision  time,  conditioned  on  a  given  hypoth¬ 
esis,  using  these  sequential  tests  is  a  linear-fractional  function 
defined  on  the  probability  simplex.  We  exploit  the  special 
structure  of  our  domain  (probability  simplex),  and  the  fact 
that  our  data  is  positive  to  tackle  the  problem  of  the  sum  and 
the  maximum  of  linear-fractional  functionals  analytically.  Our 
approach  provides  insights  into  the  behavior  of  these  functions. 
The  major  contributions  of  this  paper  are: 

i)  We  develop  a  version  of  the  SPRT  and  the  MSPRT 
where  the  sensor  is  selected  randomly  at  each  obser¬ 
vation. 

ii)  We  determine  the  asymptotic  expressions  for  the  thresh¬ 
olds  and  the  expected  sample  size  for  these  sequential 
tests. 

iii)  We  incorporate  the  processing  time  of  the  sensors  into 
these  models  to  determine  the  expected  decision  time. 

iv)  We  show  that,  to  minimize  the  conditioned  expected 
decision  time,  the  optimal  policy  requires  only  one 
sensor  to  be  observed. 

v)  We  show  that,  for  a  generic  set  of  sensors  and  M 
underlying  hypotheses,  the  optimal  average  decision 
time  policy  requires  the  fusion  center  to  consider  at  most 
M  sensors. 

vi)  For  the  binary  hypothesis  case,  we  identify  the  optimal 
set  of  sensors  in  the  worst  case  and  the  average  decision 
time  minimization  problems.  Moreover,  we  determine  an 
optimal  probability  distribution  for  the  sensor  selection. 

vii)  In  the  worst  case  and  the  average  decision  time  mini¬ 
mization  problems,  we  encounter  the  problem  of  min¬ 
imization  of  sum  and  maximum  of  linear-fractional 
functionals.  We  treat  these  problems  analytically,  and 
provide  insight  into  their  optimal  solutions. 

The  remainder  of  the  paper  is  organized  in  following  way.  In 
Section  El  we  present  the  problem  setup.  Some  preliminaries 


are  presented  in  Section  EH  We  develop  the  switching-sensor 
version  of  the  SPRT  and  the  MSPRT  procedures  in  Section  HVl 
In  Section  [V]  we  formulate  the  optimization  problems  for 
time-optimal  sensor  selection,  and  determine  their  solution. 
We  elucidate  the  results  obtained  through  numerical  examples 
in  Section  [VT]  Our  concluding  remarks  are  in  Section  I  VII  I 

II.  PROBLEM  SETUP 

We  consider  a  group  of  n  >  1  agents  (e.g.,  robots,  sensors, 
or  cameras),  which  take  measurements  and  transmit  them  to  a 
fusion  center.  We  genetically  call  these  agents  ’’sensors.”  We 
identify  the  fusion  center  with  a  person  supervising  the  agents, 
and  call  it  ’’the  supervisor.” 


Fig.  1.  The  agents  A  transmit  their  observation  to  the  supervisor  S,  one  at 
the  time.  The  supervisor  performs  a  sequential  hypothesis  test  to  decide  on 
the  underlying  hypothesis. 

The  goal  of  the  supervisor  is  to  decide,  based  on  the 
measurements  it  receives,  which  of  the  M  >  2  alternative 
hypotheses  or  “states  of  nature,”  H k,  k  £  {0, . . . ,  M  —  1}  is 
correct.  For  doing  so,  the  supervisor  uses  sequential  hypothesis 
tests,  which  we  briefly  review  in  the  next  section. 

We  assume  that  only  one  sensor  can  transmit  to  the  supervi¬ 
sor  at  each  (discrete)  time  instant.  Equivalently,  the  supervisor 
can  process  data  from  only  one  of  the  n  agents  at  each  time. 
Thus,  at  each  time,  the  supervisor  must  decide  which  sensor 
should  transmit  its  measurement.  We  are  interested  in  finding 
the  optimal  sensor(s),  which  must  be  observed  in  order  to 
minimize  the  decision  time. 

We  model  the  setup  in  the  following  way: 

i)  Let  si  £  {1, . . . ,  n}  indicate  which  sensor  transmits  its 
measurement  at  time  instant  l  £  N. 

ii)  Conditioned  on  the  hypothesis  Hk,  k  £  {0, . . . ,  M  —  1}, 
the  probability  that  the  measurement  at  sensor  s  is  y,  is 
denoted  by  fg(y). 

iii)  The  prior  probability  of  the  hypothesis  H k,  k  £ 
{0, . . . ,  M  —  1},  being  correct  is  irk. 

iv)  The  measurement  of  sensor  s  at  time  l  is  ySl .  We  assume 
that,  conditioned  on  hypothesis  Hk,  ySl  is  independent 

of  y(sT),  for  (l,si)  ±  ( l,sT ). 

v)  The  time  it  takes  for  sensor  s  to  transmit  its  measurement 
(or  for  the  supervisor  to  process  it)  is  Ts  >  0. 

vi)  The  supervisor  chooses  a  sensor  randomly  at  each  time 
instant;  the  probability  to  choose  sensor  s  is  stationary 
and  given  by  qs. 

vii)  The  supervisor  uses  the  data  collected  to  execute  a 
sequential  hypothesis  test  with  the  desired  probability  of 
incorrect  decision,  conditioned  on  hypothesis  Hk,  given 
by  ak. 
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viii)  We  assume  that  there  are  no  two  sensors  with  identical 
conditioned  probability  distribution  fk(y)  and  process¬ 
ing  time  Ts.  If  there  are  such  sensors,  we  club  them 
together  in  a  single  node,  and  distribute  the  probability 
assigned  to  that  node  equally  among  them. 


III.  PRELIMINARIES 


A.  Linear-fractional  function 

Given  parameters  A  £  Rqxp,  B  el5,  c  £  Kp,  and  d  £  R, 
the  function  g  :  {z  £  Rp  |  cTz  +  d  >  0}  — >  R9,  defined  by 

.  .  Ax +  B 

9\A)  — x  :  ji 

c1  x  +  a 

is  called  a  linear-fractional  function  ED-  A  linear-fractional 
function  is  quasi-convex  as  well  as  quasi-concave.  In  partic¬ 
ular,  if  q  =  1,  then  any  scalar  linear-fractional  function  g 
satisfies 


g{tx  +  (1  -  t)y)  <  max{s(a:),  g(y)}, 
g{tx  +  (1  -  t)y)  >  min {g(x),g(y)}, 


Given  the  probability  of  false  alarm  F(H\\Ho)  =  cto  and 
probability  of  missed  detection  F(Hq\Hi)  =  ai,  the  Wald’s 
thresholds  770  and  771  are  defined  by 

Vo  =  log  ,  Ql  ,  and  771  =  log  - — — .  (3) 

1  —  ao  ao 

The  expected  sample  size  N,  for  decision  using  SPRT  is 
asymptotically  given  by 


E  [N\H0] 
E  [N\Hi] 


(1  -  010)770  +  00771  -  A0 

W/1) 

oi?7o  +  (1  -  01)771  -  A0 

V{f\n 


and 


(4) 


as  —770,  Vi  — >  00.  The  asymptotic  expected  sample  size  expres¬ 
sions  in  equation  (0}  are  valid  for  large  thresholds.  The  use  of 
these  asymptotic  expressions  as  approximate  expected  sample 
size  is  a  standard  approximation  in  the  information  theory  lit¬ 
erature,  and  is  known  as  Wald’s  approximation  a,  am  ns. 

For  given  error  probabilities,  the  SPRT  is  the  optimal  se¬ 
quential  binary  hypothesis  test,  if  the  sample  size  is  considered 
as  the  cost  function  ED- 


for  all  t  £  [0, 1]  and  x,  y  €  {z  €  Rp  |  cT z  +  d  >  0}. 


B.  Kullback-Leibler  distance 


Given  two  probability  distributions  functions  /1  :  R  — >  R>o 
and  fi  :  R  — >  R>o,  the  Kullback-Leibler  distance  V  :  C1  x 
Cl  — >  R  is  defined  by 


V{fuf2)  =  F 


log 


h{X) 
h{X)  J 


=  f  h(x)logjj^dx. 
Jr  /2O) 


Further,  2?(/i,/2)  >  0,  and  the  equality  holds  if  and  only  if 
fi  =  fi  almost  everywhere. 


C.  Sequential  Probability  Ratio  Test 

The  SPRT  is  a  sequential  binary  hypothesis  test  that  pro¬ 
vides  us  with  two  thresholds  to  decide  on  some  hypothesis, 
opposed  to  classical  hypothesis  tests,  where  we  have  a  single 
threshold.  Consider  two  hypothesis  Hq  and  If ,  with  prior 
probabilities  7To  and  it\,  respectively.  Given  their  conditional 
probability  distribution  functions  f{y\Hf)  =  f°(y)  and 
f(y\Hi)  =  /1(y),  and  repeated  measurements  { J/i ,  2/2 
with  Ao  defined  by 

Ao  =  log  — ,  (2) 

7T0 

the  SPRT  provides  us  with  two  constants  770  and  771  to  decide 
on  a  hypothesis  at  each  time  instant  l,  in  the  following  way: 

i)  Compute  the  log  likelihood  ratio:  A;  :=  log 

N 

ii)  Integrate  evidence  up  to  time  N,  i.e..  An  :=  ]C  A/, 

;= 0 

iii)  Decide  only  if  a  threshold  is  crossed,  i.e., 

{An  >  771,  say  TTi, 

An  <  go,  say  H0, 

An  £  ]?7o,  771  [,  continue  sampling. 


D.  Multi-hypothesis  Sequential  Probability  Ratio  Test 

The  MSPRT  for  multiple  hypothesis  testing  was  introduced 
in  0,  and  was  further  generalized  in  ED  and  lUTl.  It  is 
described  as  follows.  Given  M  hypotheses  with  their  prior 
probabilities  ttk,k  £  {0, . . . ,  M  —  1},  the  posterior  probability 
after  N  observations  yi,l  £  N}  is  given  by 


N  , 

ttk  n  fk(yi) 

pkN  =  F(H  =  Hk\yi,  •  •  -,yN)  =  M-1  l=\ - 

E  Tfi  (  n 

3= 1  V=1 

where  fk  is  the  probability  density  function  of  the  observation 
of  the  sensor,  conditioned  on  hypothesis  k. 

Before  we  state  the  MSPRT,  for  a  given  N,  we  define  k  by 

N 

k=  argmax  TtjHfJ(yi). 
je{o,...,M-i}  *-1 

The  MSPRT  at  each  sampling  iteration  l  is  defined  as 

Pi  >  i+Vk , f°r  at  ieast  one  say  Hk, 

otherwise,  continue  sampling. 


where  the  thresholds  77 &,  for  given  frequentist  error  probabili¬ 
ties  (accept  a  given  hypothesis  wrongly)  otk,  k  £  {0 , ,M  — 
1},  are  given  by 

V  k  =  - ,  (5) 


'Kk^ik 


where  7 k  £  ]0, 1[  is  a  constant  function  of  fk  (see  ||5l). 

It  can  be  shown  0  that  the  expected  sample  size  of  the 
MSPRT,  conditioned  on  a  hypothesis,  satisfies 


E[N\Hk] 


-  log  V k 

&k 


as  max  rjk  — *  0+, 


where  Sk  =  min {T>(fk,  f^)  \  j  £  {0, . . . ,  M  -  1}  \  {fc}}. 

The  MSPRT  is  an  easily-implementable  hypothesis  test  and 
is  shown  to  be  asymptotically  optimal  in  0,  m. 
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IV.  Sequential  hypothesis  tests  with  switching 

SENSORS 


A.  SPRT  with  switching  sensors 

Consider  the  case  when  the  fusion  center  collects  data  from 
n  sensors.  At  each  iteration  the  fusion  center  looks  at  one 
sensor  chosen  randomly  with  probability  qs,  s  £  {[....  ,n  \ . 
The  fusion  center  performs  SPRT  with  the  collected  data. 
We  define  this  procedure  as  SPRT  with  switching  sensors.  If 
we  assume  that  sensor  si  is  observed  at  iteration  l,  and  the 
observed  value  is  ySl,  then  SPRT  with  switching  sensors  is 
described  as  following,  with  the  thresholds  rjo  and  771  defined 
in  equation  (0,  and  Ao  defined  in  equation  0: 

i)  Compute  log  likelihood  ratio: 


A 1  :=  log 


fstiVsi) 


N 

ii)  Integrate  evidence  up  to  time  N,  i.e.,  An  :=  E  A/, 

;=o 

iii)  Decide  only  if  a  threshold  is  crossed,  i.e.. 


!An  >  771,  say  Hi, 

A  tv  <  170,  say  H0, 

An  £  ]77o,t?i[,  continue  sampling. 


Lemma  1  (Expected  sample  size):  For  the  SPRT  with 
switching  sensors  described  above,  the  expected  sample  size 
conditioned  on  a  hypothesis  is  asymptotically  given  by: 

E[N\H0}^-{1~ano)r,0  +  aom~X°,  and 

1  X  (6> 

Qh 7?o  +  (1  -  <*1)771  -  A0 

n  ’ 

E  QsV(f},fS) 

S=1 

as  -770,771  ->  00. 

Proof:  Similar  to  the  proof  of  Theorem  3.2  in  |23I.  ■ 

The  expected  sample  size  converges  to  the  values  in  equa¬ 
tion  ©  for  large  thresholds.  From  equation  0,  it  follows 
that  large  thresholds  correspond  to  small  error  probabilities. 
In  the  remainder  of  the  paper,  we  assume  that  the  error 
probabilities  are  chosen  small  enough,  so  that  the  above 
asymptotic  expression  for  sample  size  is  close  to  the  actual 
expected  sample  size. 

Lemma  2  (Expected  decision  time):  Given  the  processing 
time  of  the  sensors  Ts,  s  £  {1, . . . ,  n},  the  expected  decision 
time  of  the  SPRT  with  switching  sensors  Td,  conditioned  on 
the  hypothesis  Hk,  k  £  {0, 1},  is 

n 

E  QsTs 

E[ Td\Hk\  =  - =  S—  for  each  fcG  {0,1},  (7) 

Zdsi!)  q  1 

s=  1 


where  T,  Ik  E  R>0,  are  constant  vectors  for  each  k  £  {0, 1}. 

Proof:  The  decision  time  using  SPRT  with  switching 
sensors  is  the  sum  of  sensor’s  processing  time  at  each  iter¬ 
ation.  We  observe  that  the  number  of  iterations  in  SPRT  and 


the  processing  time  of  sensors  are  independent.  Hence,  the 
expected  value  of  the  decision  time  Td  is 

E[Td\Hk]=E[N\Hk}E[T],  for  each  k  £  {0, 1}.  (8) 

By  the  definition  of  expected  value, 

n 

E{T]=J2<1sTs.  (9) 

S—  1 

From  equations  ©,  ([8]),  and  ©  it  follows  that 

n 

E  QsTs 

E  [Td\Hk\  =  £E - =  for  each  k  £  {O’  !}i 

E  ^  q  1 

s= 1 

where  I £  £  R>o  is  a  constant,  for  each  k  £  {0, 1},  and  s  £ 

{1,...,tz}.  ■ 

B.  MSPRT  with  switching  sensors 

We  call  the  MSPRT  with  the  data  collected  from  n  sensors 
while  observing  only  one  sensor  at  a  time  as  the  MSPRT  with 
switching  sensors.  The  one  sensor  to  be  observed  at  each  time 
is  determined  through  a  randomized  policy,  and  the  probability 
of  choosing  sensor  s  is  stationary  and  given  by  qs.  Assume 
that  the  sensor  si  £  {l,...,n}  is  chosen  at  time  instant 
l,  and  the  prior  probabilities  of  the  hypothesis  are  given  by 
7 rfe,  k  £  {0, . . . ,  M  —  1},  then  the  posterior  probability  after 
N  observations  yi,l  £  {1, ...  ,N}  is  given  by 

N 

.  ’ffc.g.  fs^Vl) 

pkN  =F(Hk\yi,...,yN)  =  — - |  N  - 

E  A?  \.uJsi(yi) 

j=o  V=1 

Before  we  state  the  MSPRT  with  switching  sensors,  for  a 
given  N,  we  define  k  by 

N 

k=  argmax  nk  II  f*  (yi). 
fce{o,...,M— 1}  ‘-1 

For  the  thresholds  k  £  {0, . . . ,  M  —  1},  defined  in 
equation  ©,  the  MSPRT  with  switching  sensors  at  each 
sampling  iteration  N  is  defined  by 

Pn  >  f°r  at  least  0ne  say  Alk, 

otherwise,  continue  sampling. 

Before  we  state  the  results  on  asymptotic  sample  size  and 
expected  decision  time,  we  introduce  the  following  notation. 
For  a  given  hypothesis  Hk,  and  a  sensor  s,  we  define  jJ  ^ 
by 

3{a,k)=  argmin 

We  also  define  £x>  :  An_i  x  ( C1)'1  x  (/l1)”  — ►  K  by 

n 

3=  1 

where  An_i  represents  the  probability  simplex  in  W1. 
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Lemma  3  (Expected  sample  size):  Given  thresholds  rjj,  j  £ 
{0, ...  ,M  —  1},  the  sample  size  N  required  for  decision 
satisfies 

E  [N\Hk\  1 


-log  Vk 

m  -►  0. 


£v{q, 


as  max 
fe{o . m-  1}  ' 

Proof:  The  proof  follows  from  Theorem  5.1  of  J3]  and 
the  observation  that 


lim 

N  — >oo  N 1 


=  X>w,‘,  id- 


1  =  1 


8  =  1 


Lemma  4  (Expected  decision  time):  Given  the  processing 
time  of  the  sensors  Ts,s  £  {l,...,n},  the  expected  deci¬ 
sion  time  Td  conditioned  on  the  hypothesis  Ht,  for  each 
k  €  {0, . . . ,  M  —  1},  is  given  by 


E[Td\Hk\ 


log  Vk  y-  T  =  q-T 
£v(qJk,P*)frCs  3  q-~Ik ’ 


(10) 


where  T,  Ik  £  M”0  are  constants. 

Proof:  Similar  to  the  proof  of  Lemma  [2]  ■ 


V.  OPTIMAL  SENSOR  SELECTION 

In  this  section  we  consider  sensor  selection  problems  with 
the  aim  to  minimize  the  expected  decision  time  of  a  sequential 
hypothesis  test  with  switching  sensors.  As  exemplified  in 
Lemma  01  the  problem  features  multiple  conditioned  decision 
times  and,  therefore,  multiple  distinct  cost  functions  are  of 
interest.  In  Scenario  I  below,  we  aim  to  minimize  the  decision 
time  conditioned  upon  one  specific  hypothesis  being  true;  in 
Scenarios  II  and  III  we  will  consider  worst-case  and  average 
decision  times.  In  all  three  scenarios  the  decision  variables 
take  values  in  the  probability  simplex. 

Minimizing  decision  time  conditioned  upon  a  specific  hy¬ 
pothesis  may  be  of  interest  when  fast  reaction  is  required 
in  response  to  the  specific  hypothesis  being  indeed  true.  For 
example,  in  change  detection  problems  one  aims  to  quickly 
detect  a  change  in  a  stochastic  process;  the  CUSUM  algorithm 
(also  referred  to  as  Page’s  test)  M  is  widely  used  in  such 
problems.  It  is  known  m  that,  with  fixed  threshold,  the 
CUSUM  algorithm  for  quickest  change  detection  is  equivalent 
to  an  SPRT  on  the  observations  taken  after  the  change  has 
occurred.  We  consider  the  minimization  problem  for  a  single 
conditioned  decision  time  in  Scenario  I  below  and  we  show 
that,  in  this  case,  observing  the  best  sensor  each  time  is  the 
optimal  strategy. 

In  general,  no  specific  hypothesis  might  play  a  special  role 
in  the  problem  and,  therefore,  it  is  of  interest  to  simultane¬ 
ously  minimize  multiple  decision  times  over  the  probability 
simplex.  This  is  a  multi-objective  optimization  problem,  and 
may  have  Pareto-optimal  solutions.  We  tackle  this  problem 
by  constructing  a  single  aggregate  objective  function.  In  the 
binary  hypothesis  case,  we  construct  two  single  aggregate 
objective  functions  as  the  maximum  and  the  average  of  the  two 
conditioned  decision  times.  These  two  functions  are  discussed 
in  Scenario  II  and  Scenario  III  respectively.  In  the  multiple 


hypothesis  setting,  we  consider  the  single  aggregate  objective 
function  constructed  as  the  average  of  the  conditioned  decision 
times.  An  analytical  treatment  of  this  function  for  M  >  2,  is 
difficult.  We  determine  the  optimal  number  of  sensors  to  be 
observed,  and  direct  the  interested  reader  to  some  iterative 
algorithms  to  solve  such  optimization  problems.  This  case  is 
also  considered  under  Scenario  III. 

Before  we  pose  the  problem  of  optimal  sensor  selection,  we 
introduce  the  following  notation.  We  denote  the  probability 
simplex  in  R™  by  An_i,  and  the  vertices  of  the  probability 
simplex  An_i  by  e^,  i  £  {1  We  refer  to  the  line 

joining  any  two  vertices  of  the  simplex  as  an  edge.  Finally, 
we  define  gk  :  A„_i  — >  R,  k  £  {0, . . . ,  M  —  1},  by 


a\q) 


q-T 
q  ■  Ik 


A.  Scenario  I  (Optimization  of  conditioned  decision  time): 

We  consider  the  case  when  the  supervisor  is  trying  to 
detect  a  particular  hypothesis,  irrespective  of  the  present  hy¬ 
pothesis.  The  corresponding  optimization  problem  for  a  fixed 
k  £  {0, ...  ,M  —  1}  is  posed  in  the  following  way: 

minimize  qk(q) 

w  (11) 

subject  to  q  £  A„_i. 

The  solution  to  this  minimization  problem  is  given  in  the 
following  theorem. 

Theorem  1  (Optimization  of  conditioned  decision  time): 
The  solution  to  the  minimization  problem  (fill)  is  q*  =  es* , 
where  s*  is  given  by 

*  .  Ts 

s  =  argmin  —  . 

and  the  minimum  objective  function  is 

EPS  |  Hk\  =  jr-  (12) 

±s* 

Proof:  We  notice  that  objective  function  is  a  linear- 
fractional  function.  In  the  following  argument,  we  show  that 
the  minima  occurs  at  one  of  the  vertices  of  the  simplex. 

We  first  notice  that  the  probability  simplex  is  the  convex 
hull  of  the  vertices,  i.e.,  any  point  q  in  the  probability  simplex 
can  be  written  as 

n  n 

q  =  ^ases,  =  1,  and  as  >  0. 

s= 1  s= 1 

We  invoke  equation  (|T]i,  and  observe  that  for  some  (3  £  [0, 1] 
and  for  any  s,r  £  {1 , . . .  ,n} 

gk(/3es  +  (1  -  (3)er)  >  min{/(es),  gk(er)},  (13) 

which  can  be  easily  generalized  to 

gk(q)  >  min  gk(es),  (14) 

;e{i,...,n} 

for  any  point  q  in  the  probability  simplex  A„_i.  Hence, 
minima  will  occur  at  one  of  the  vertices  e.,-,  where  s*  is 
given  by 

T 

s*  =  argmin  gk(es)  =  argmin 

s£{l,...,n}  s£{l,...,n} 
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B.  Scenario  II  (Optimization  of  the  worst  case  decision  time): 

For  the  binary  hypothesis  testing,  we  consider  the  multi¬ 
objective  optimization  problem  of  minimizing  both  decision 
times  simultaneously.  We  construct  single  aggregate  objective 
function  by  considering  the  maximum  of  the  two  objective 
functions.  This  turns  out  to  be  a  worst  case  analysis,  and  the 
optimization  problem  for  this  case  is  posed  in  the  following 
way: 


minimize  max  { <7° (<z) ,  g1^)}  , 
subject  to  q  £  A„_i. 


(15) 


Before  we  move  on  to  the  solution  of  above  minimization 
problem,  we  state  the  following  results. 

Lemma  5  ( Monotonicity  of  conditioned  decision  times): 
The  functions  gk,  k  £  {0, . . . ,  M  —  1}  are  monotone  on 
the  probability  simplex  A„_i,  in  the  sense  that  given  two 
points  qi,q2  £  An_i,  the  function  gk  is  monotonically 
non-increasing  or  monotonically  non-decreasing  along  the 
line  joining  qi  and  q2. 

Proof:  Consider  probability  vectors  q\,q2  £  A„_i .  Any 
point  q  on  line  joining  qi  and  q2  can  be  written  as  q(t)  = 
tqi  +  (1  —  t)q2,  t  £  ]0, 1[.  We  note  that  gk(q(t))  is  given  by: 


gkm) 


t(qi  -T)  +  (l-t)(q2  -T) 
t(qi  •  Ik )  +  (1  -  t)(q2  ■  Ik)' 


The  derivative  of  gk  along  the  line  joining  qi  and  q2  is  given 
by 


±gk(q(t))  =  (gk(qi)-gk(q2)) 

(qi  ■  Ik)(q2  ■  Ik) 

(t{qi  •  Ik)  +  (1  -  t)(q2  ■  lk))2' 

We  note  that  the  sign  of  the  derivative  of  gk  along  the  line 
joining  two  points  qi,q2  is  fixed  by  the  choice  of  <j\  and 
q2.  Hence,  the  function  gk  is  monotone  over  the  line  joining 
qi  and  q2.  Moreover,  note  that  if  gk(qi)  gk(q2 ),  then  gk 

is  strictly  monotone.  Otherwise,  gk  is  constant  over  the  line 
joining  qi  and  q2.  ■ 

Lemma  6  ( Location  of  min -max):  Define  g  :  A„_i  — >  R>o 
by  g  =  max{g°,  g1}.  A  minimum  of  g  lies  at  the  intersection 
of  the  graphs  of  g°  and  g1,  or  at  some  vertex  of  the  probability 
simplex  An_i. 

Proof:  Case  1:  The  graphs  of  g°  and  <f  do  not  intersect 
at  any  point  in  the  simplex  A„_i. 

In  this  case,  one  of  the  functions  g°  and  g[  is  an  upper 
bound  to  the  other  function  at  every  point  in  the  probability 
simplex  An_i.  Hence,  g  =  gk,  for  some  k  £  {0, 1},  at  every 
point  in  the  probability  simplex  An_i.  From  Theorem  Q]  we 
know  that  the  minima  of  gk  on  the  probability  simplex  An_i 
lie  at  some  vertex  of  the  probability  simplex  A„_i. 

Case  2:  The  graphs  of  g°  and  g1  intersect  at  a  set  Q  in  the 
probability  simplex  A„_i,  and  let  q  be  some  point  in  the  set 

Q. 

Suppose,  a  minimum  of  g  occurs  at  some  point  q*  £ 
relint(A„_i),  and  q*  f  Q,  where  relint(-)  denotes  the  relative 
interior.  With  out  loss  of  generality,  we  can  assume  that 


g°(q*)  >  g1(q*).  Also,  g°(q)  =  g1(q),  and  g°(q*)  <  g°(q) 
by  assumption. 

We  invoke  Lemma[5]  and  notice  that  g°  and  g1  can  intersect 
at  most  once  on  a  line.  Moreover,  we  note  that  g°(q*)  > 
g1(q*),  hence,  along  the  half-line  from  q  through  g *,  g°  >  g\ 
that  is,  g  =  g°.  Since  g°(q*)  <  g°{q),  g  is  decreasing  along 
this  half-line.  Hence,  g  should  achieve  its  minimum  at  the 
boundary  of  the  simplex  An_i,  which  contradicts  that  q*  is 
in  the  relative  interior  of  the  simplex  An_  i .  In  summary,  if 
a  minimum  of  g  lies  in  the  relative  interior  of  the  probability 
simplex  A„_i,  then  it  lies  at  the  intersection  of  the  graphs  of 
g°  and  g1. 

The  same  argument  can  be  applied  recursively  to  show  that 
if  a  minimum  lies  at  some  point  q'  on  the  boundary,  then 
either  .g°(qt)  =  g 1(^t)  or  the  minimum  lies  at  the  vertex.  ■ 

In  the  following  arguments,  let  Q  be  the  set  of  points  in  the 
simplex  An_i,  where  g°  =  g1,  that  is, 

Q  =  {q£  A„_x  |g-(/°-/1)=0}.  (16) 


Also  notice  that  the  set  Q  is  non  empty  if  and  only  if  1°  —  I1 
has  at  least  one  non-negative  and  one  non-positive  entry.  If  the 
set  Q  is  empty,  then  it  follows  from  Lemma[6]that  the  solution 
of  optimization  problem  in  equation  dT5l>  lies  at  some  vertex  of 
the  probability  simplex  An_i.  Now  we  consider  the  case  when 
Q  is  non  empty.  We  assume  that  the  sensors  have  been  re¬ 
ordered  such  that  the  entries  in  1°  —  1 1  are  in  ascending  order. 
We  further  assume  that,  for  1°  —  I1,  the  first  m  entries,  m  <  n, 
are  non  positive,  and  the  remaining  entries  are  positive. 

Lemma  7  (Intersection  polytope):  If  the  set  Q  defined  in 
equation  (IT6l)  is  non  empty,  then  the  polytope  generated  by 
the  points  in  the  set  Q  has  vertices  given  by: 


Q  =  {qsr  |  s  £  {1, . . . ,  to}  and  r  £  {m  +  1, . . . ,  n}}, 


where  for  each  i  £  {1, ...  ,n} 

(/°-/)) 


qlr  = 


i  -q\ 


:sr 
s  i 


v0, 


if  i  =  s, 
if  i  =  r, 

otherwise. 


(17) 


Proof:  Any  q  £  Q  satisfies  the  following  constraints 


n 


=  1)  9s  £  [0,1], 

S=1 

(18) 

n 

J2qs(i°s  -  it)  =  o, 

(19) 

s=  1 


Eliminating  qn,  using  equation  (IT8]>  and  equation  (IT9l).  we  get: 

”~1  (T°  -  71!  —  (T°  —  T1) 

J2l3sqs  =  1,  where  ft  =  1  ^  ^ (20) 

The  equation  (f20t  defines  a  hyperplane,  whose  extreme  points 
in  K}"1  are  given  by 

qsn  =  yy-e-s,  t  6  1}. 

Hs 

Note  that  for  s  £  {l,...,m},  qsn  £  Arl_1.  Hence,  these 
points  define  some  vertices  of  the  polytope  generated  by  points 
in  the  set  Q.  Also  note  that  the  other  vertices  of  the  polytope 
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can  be  determined  by  the  intersection  of  each  pair  of  lines 
through  qsn  and  qrn,  and  es  and  er,  for  s  £  {1, . . . ,  to},  and 
r  £  {to  + 1, . . . ,  n  —  1}.  In  particular,  these  vertices  are  given 
by  qsr  defined  in  equation  ifTTb. 

Hence,  all  the  vertices  of  the  polytopes  are  defined  by  qsr, 
s  £  {1, . . . ,  to},  r  £  {to  +  1, . . . ,  n}.  Therefore,  the  set  of 
vertices  of  the  polygon  generated  by  the  points  in  the  set  Q 
is  Q.  m 

Before  we  state  the  solution  to  the  optimization  problem 
we  define  the  following: 


,  *  _  •  (4°  -  Ir)Ts  -  (7°  -  ll)Tr 

(s  ,r  )  £  argmin  - - ,  and 


r£{m  +  l n} 

»€{1 . m} 


/  *  *  \  _ 

y  two-sensors  5  T  )  — 

We  also  define 


w  =  argmin  max 
(T  *  T  * 

/  *\  I  -Lw  -Lw 

^one-sensor  )  —  max  \  — q  , 

L  ^w* 


(#  -  I^)TS,  -  (7°.  -  I],)Tr 
71,7°„  -  I°.lK 


T  T 

-LW  -Lw 

To-’  7 r 

w  w 


and 


Theorem  2  (Worst  case  optimization):  For  the  optimization 
problem  (fl5l),  an  optimal  probability  vector  is  given  by: 

*  J  &v)*  ,  if  <7one-sensor (tU  )  li  fj[  wo-sensors  5^  )? 

\qS  r  ,  if  ffone-sensor(ui*)  >  fftwo-sensors (s*  ,  V*  )  , 

and  the  minimum  value  of  the  function  is  given  by: 

min  {3  one-sensor  iw  );  fltwo-sensors  (s  if  )}  ■ 

Proof:  We  invoke  Lemma  |6]  and  note  that  a  minimum 
should  lie  at  some  vertex  of  the  simplex  An_i,  or  at  some 
point  in  the  set  Q.  Note  that  g°  =  g1  on  the  set  Q,  hence  the 
problem  of  minimizing  max{r/°.  <)'' }  reduces  to  minimizing 
g°  on  the  set  Q.  From  Theorem  [I]  we  know  that  g°  achieves 
the  minima  at  some  extreme  point  of  the  feasible  region.  From 
Lemma  |7]  we  know  that  the  vertices  of  the  polytope  generated 
by  points  in  set  Q  are  given  by  set  Q.  We  further  note  that 
fltwo-SenSors(s,r)  and  ^one-sensor (tu)  are  the  value  of  objective 
function  at  the  points  in  the  set  Q  and  the  vertices  of  the 
probability  simplex  An_i  respectively,  which  completes  the 
proof.  ■ 


C.  Scenario  III  (Optimization  of  the  average  decision  time): 

For  the  multi-objective  optimization  problem  of  minimizing 
all  the  decision  times  simultaneously  on  the  simplex,  we  for¬ 
mulate  the  single  aggregate  objective  function  as  the  average 
of  these  decision  times.  The  resulting  optimization  problem, 
for  M  >  2,  is  posed  in  the  following  way: 

minimize  ^  (5°  (9)  +  ...  +  gM^{q)),  ^ 

subject  to  q  £  A„_i. 

In  the  following  discussion  we  assume  n  >  M,  unless 
otherwise  stated.  We  analyze  the  optimization  problem  in 
equation  (fTTI)  as  follows: 


Lemma  8  (Non-vanishing  Jacobian):  The  objective  func¬ 
tion  in  optimization  problem  in  equation  (f2Tb  has  no  critical 
point  on  An_i  if  the  vectors  T,  1°, . . . ,  JM_1  £  K"0  are 
linearly  independent. 

Proof:  The  lacobian  of  the  objective  function  in  the 
optimization  problem  in  equation  (fTTI)  is 


_LA 

Mdq 


M- 1 


1X9), 


where  T  =  —  \T  - 1 0  ...  - 1 M~1] 

M  1  J 

tj;  :  A„_i  — >  Rm+1  is  defined  by 


G  R"x(m+i), 


^(9) 


M—l 


E 

fe= 0 


1 


q-T 
[ql  °)2 


q-T 

(q-IM- 1)2 


T 


and 


For  n  >  M,  if  the  vectors  T,  7°, . . . , /M_1  are  linearly 
independent,  then  F  is  full  rank.  Further,  the  entries  of  f 
are  non-zero  on  the  probability  simplex  Ar,_1 .  Hence,  the 
Jacobian  does  not  vanish  anywhere  on  the  probability  simplex 
A„_i.  ■ 

Lemma  9  (Case  of  Independent  Information):  For  M  =  2, 
if  1°  and  1 1  are  linearly  independent,  and  T  =  aol0  +  01  /  ' , 
for  some  ao,ai  £  R,  then  the  following  statements  hold: 

i)  if  ao  and  a\  have  opposite  signs,  then  g°  +  g1  has  no 
critical  point  on  the  simplex  An_i,  and 

ii)  for  ao,  au  >  0,  g°+g 1  has  a  critical  point  on  the  simplex 

A„_i  if  and  only  if  there  exists  v  £  An_i  perpendicular 
to  the  vector  —  •fafl1. 

Proof:  We  notice  that  the  Jacobian  of  g°  +  g1  satisfies 

(g.7°)2(,./1)2|(9°+51) 

=  T((g./°)(9./1)2  +  (9./1)(g- J0)2)  (22) 

—  I°{q  •  T){q  ■  I1)2  —  71(g  •  T)(q  ■  I0)2. 


Substituting  T  =  ayl0  +  ai I1,  equation  (1221  becomes 


(9./0)2(9./1)2^(5°  +  51) 

=  (a0(q  ■  I0)2  -  ai(q  ■  I1)2)  ((q  ■  J1)/0  -  (q  ■  I0)/1)  . 
Since  7°,  and  71  are  linearly  independent,  we  have 

J^(9°  +  91)  =  0  a0{q  ■  I0)2  -  cn{q  ■  I1)2  =  0. 

Hence,  g°  +  g1  has  a  critical  point  on  the  simplex  Ara_i  if 
and  only  if 

aQ(q  ■  I0)2  =  ai(q  ■  I1)2 .  (23) 


Notice  that,  if  ao,  and  a\  have  opposite  signs,  then  equation 
m  can  not  be  satisfied  for  any  q  £  An_i,  and  hence,  g°  +  g1 
has  no  critical  point  on  the  simplex  Ara_i. 

If  ao,  ai  >  0,  then  equation  (l23l>  leads  to 

q  ■  (x/a^I0  -  i/afl1)  =  0. 

Therefore,  g°  -Lg1  has  a  critical  point  on  the  simplex  An_i  if 
and  only  if  there  exists  v  £  Ara_i  perpendicular  to  the  vector 
ifafl0  -  1/afI1.  ■ 


Lemma  10  (Optimal  number  of  sensors):  For  n  >  M,  if 
each  (M  +  1)  x  (M  +  1)  submatxix  of  the  matrix 

F  =  [  r  —7°  ...  — /M_1  1  £  ]g>nx(M+l) 


is  full  rank,  then  the  following  statements  hold: 

i)  every  solution  of  the  optimization  problem  (12Tb  lies  on 
the  probability  simplex  Am-i  C  An_i;  and 

ii)  every  time-optimal  policy  requires  at  most  M  sensors 
to  be  observed. 


Proof:  From  Lemma [8]  we  know  that  if  T,  7°, ... ,  7M_1 
are  linearly  independent,  then  the  Jacobian  of  the  objective 
function  in  equation  (PH  does  not  vanish  anywhere  on  the 
simplex  A„_i.  Hence,  a  minimum  lies  at  some  simplex  A„_2, 
which  is  the  boundary  of  the  simplex  A„_i.  Notice  that,  if  n  > 
M  and  the  condition  in  the  lemma  holds,  then  the  projections 
of  T,  7°, . . . ,  7M~1  on  the  simplex  A„_2  are  also  linearly 
independent,  and  the  argument  repeats.  Hence,  a  minimum 
lies  at  some  simplex  Am-i,  which  implies  that  optimal  policy 
requires  at  most  M  sensors  to  be  observed.  ■ 

Lemma  11  (Optimization  on  an  edge):  Given  two  vertices 
es  and  eT,  s  r,  of  the  probability  simplex  A„_i,  then  for 
the  objective  function  in  the  problem  (ITTb  with  M  =  2,  the 
following  statements  hold: 

i)  if  3°(es)  <  g°(er),  and  g1{es)  <  g1(er),  then  the 
minima,  along  the  edge  joining  es  and  er,  lies  at  es, 
and  optimal  value  is  given  by  ^(g°(es)  +  g1(es))',  and 

ii)  if  g°(es)  >  g°{er),  and  g1(es)  <  g1(er),  or  vice  versa, 
then  the  minima,  along  the  edge  joining  es  and  er,  lies 
at  the  point  q*  =  (1  —  t*)es  +  t*er,  where 


7ryrs7i  -  TrI]  -  7ryrr7°  -  TJQ 
n \/TrPa  -  Ts 70  -  70 yjT'I}  -  TrIl 


and  the  optimal  value  is  given  by 


i(sV)  +  ffV)) 

1  (  Irji  -  TrI]  lTrI°-TaI°\ 

2  7s07i  -  7071  +  V  ~  ini  ) 


Proof:  We  observe  from  Lemma  [5]  that  both  g°,  and  <f 
are  monotonically  non-increasing  or  non-decreasing  along  any 
line.  Hence,  if  g°(es)  <  g°(er),  and  gl(es )  <  g1{er),  then  the 
minima  should  lie  at  es.  This  concludes  the  proof  of  the  first 
statement.  We  now  establish  the  second  statement.  We  note 
that  any  point  on  the  line  segment  connecting  es  and  er  can 
be  written  as  q(t)  =  (1  —  t)es  +  ter.  The  value  of  g°  +  g1  at 
q  is 


9°m)+g\q(t)) 


(1  —  t)Ts  +  tTr  (1  —  t)Ts  +  tTr 


Differentiating  with  respect  to  t,  we  get 


g°\q{f))+g'\q{f)) 

I°sTr  -  Ts7° 

(70  +  f(70-7°))2 


T I71  _  T  1 1 

j- s  -L  r  s±r 


(24) 


Notice  that  the  two  terms  in  equation  (P4l)  have  opposite 
sign.  Setting  the  derivative  to  zero,  and  choosing  the  value  of 
t  in  [0, 1],  we  get  t*  =  yT_,  where  p  is  as  defined  in  the 
statement  of  the  theorem.  The  optimal  value  of  the  function 
can  be  obtained,  by  substituting  t  =  t*  in  the  expression  for 

|(5°(9(i))+51(9(i)))-  ■ 

Theorem  3  (Optimization  of  average  decision  time):  For 
the  optimization  problem  in  equation  (fTTb  with  M  =  2,  the 
following  statements  hold: 

i)  If  7°,  I1  are  linearly  dependent,  then  the  solution  lies  at 
some  vertex  of  the  simplex  A„_i. 

ii)  If  7°  and  71  are  linearly  independent,  and  T  =  oo/°  + 
ail1,  ao,a\  £  R,  then  the  following  statements  hold: 

a)  If  On  and  o:-|  have  opposite  signs,  then  the  optimal 
solution  lies  at  some  edge  of  the  simplex  A„_i. 

b)  If  ao ,  cti  >  0,  then  the  optimal  solution  may  lie  in 
the  interior  of  the  simplex  A„_!. 

iii)  If  every  3x3  sub-matrix  of  the  matrix  [T  7°  71]  £ 

Rnx3  is  full  rank,  then  a  minimum  lies  at  an  edge  of 
the  simplex  An_i. 

Proof:  We  start  by  establishing  the  first  statement.  Since, 
7°  and  71  are  linearly  dependent,  there  exists  a  7  >  0  such 
that  7°  =  771.  For  7°  =  771,  we  have  g°  +  g1  =  (1  +  7 )g°. 
Hence,  the  minima  of  g°  +  g1  lies  at  the  same  point  where 
g°  achieves  the  minima.  From  Theorem  Q]  it  follows  that  g° 
achieves  the  minima  at  some  vertex  of  the  simplex  An_i. 

To  prove  the  second  statement,  we  note  that  from  Lemma  |9] 
it  follows  that  if  ao.  and  «i  have  opposite  signs,  then  the 
Jacobian  of  g°  +  g1  does  not  vanish  anywhere  on  the  simplex 
A„_i.  Hence,  the  minima  lies  at  the  boundary  of  the  simplex. 
Notice  that  the  boundary,  of  the  simplex  An_i,  are  n  simplices 
An_2 .  Notice  that  the  argument  repeats  till  n  >  2.  Hence,  the 
optima  lie  on  one  of  the  simplices  Ai,  which  are  the  edges 
of  the  original  simplex.  Moreover,  we  note  that  from  Lemma[h] 
it  follows  that  if  ao,ai  >  0,  then  we  can  not  guarantee  the 
number  of  optimal  set  of  sensors.  This  concludes  the  proof  of 
the  second  statement. 

To  prove  the  last  statement,  we  note  that  it  follows  im¬ 
mediately  from  Lemma  [TO]  that  a  solution  of  the  optimization 
problem  in  equation  (|2H  would  lie  at  some  simplex  Al5  which 
is  an  edge  of  the  original  simplex.  ■ 

Note  that,  we  have  shown  that,  for  M  =  2  and  a  generic 
set  of  sensors,  the  solution  of  the  optimization  problem  in 
equation  (PH  lies  at  an  edge  of  the  simplex  An_i.  The  optimal 
value  of  the  objective  function  on  a  given  edge  was  determined 
in  Lemma  [TU  Hence,  an  optimal  solution  of  this  problem  can 
be  determined  by  a  comparison  of  the  optimal  values  at  each 
edge. 

For  the  multiple  hypothesis  case,  we  have  determined 
the  time-optimal  number  of  the  sensors  to  be  observed  in 
Lemma  [10]  In  order  to  identify  these  sensors,  one  needs  to 
solve  the  optimization  problem  in  equation  (PH.  We  notice 
that  the  objective  function  in  this  optimization  problem  is  non- 
convex,  and  is  hard  to  tackle  analytically  for  M  >  2.  Interested 
reader  may  refer  to  some  efficient  iterative  algorithms  in 
linear-fractional  programming  literature  (e.g.,  ©)  to  solve 
these  problems. 
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VI.  NUMERICAL  EXAMPLES 

We  consider  four  sensors  connected  to  a  fusion  center. 
We  assume  that  the  sensors  take  binary  measurements.  The 
probabilities  of  their  measurement  being  zero,  under  two 
hypotheses,  and  their  processing  times  are  given  in  the  Table  Q] 


TABLE  I 

Conditional  probabilities  of  measurement  being  zero 


Sensor 

Probability(O) 

Processing  Time 

Hypothesis  0 

Hypothesis  1 

i 

0.4076 

0.5313 

0.6881 

2 

0.8200 

0.3251 

3.1960 

3 

0.7184 

0.1056 

5.3086 

4 

0.9686 

0.6110 

6.5445 

We  performed  Monte-Carlo  simulations  with  the  mis- 
detection  and  false-alarm  probabilities  fixed  at  10-3,  and 
computed  expected  decision  times.  In  Table  [III  the  numerical 
expected  decision  times  are  compared  with  the  decision  times 
obtained  analytically  in  equation  (O.  The  difference  in  the 
numerical  and  the  analytical  decision  times  is  explained  by 
the  Wald’s  asymptotic  approximations. 

TABLE  II 

Decision  times  for  various  sensor  selection  probabilities 


Sensor  selection 
probability 

Expected  Decision  Time 

Hypothesis  0 

Hypothesis  1 

Analytical 

Numerical 

Analytical 

Numerical 

[1.0, 0,0] 

154.39 

156.96 

152.82 

157.26 

[0.3768,0,0,0.6232] 

124.35 

129.36 

66.99 

76.24 

[0.25,0.25,0.25,0.25] 

55.04 

59.62 

50.43 

55.73 

In  the  Table  EH  we  compare  optimal  policies  in  each 
Scenario  I,  II,  and  III  with  the  policy  when  each  sensor 
is  chosen  uniformly.  It  is  observed  that  the  optimal  policy 
improves  the  expected  decision  time  significantly  over  the 
uniform  policy. 


TABLE  in 

Comparison  of  the  uniform  and  optimal  policies 


Scenario 

Uniform  policy 

Optimal  policy 

Objective 

function 

Optimal  probability 
vector 

Optimal  objective 
function 

i 

59.62 

[0,0, 1,0] 

41.48 

ii 

55.73 

[0,0, 1,0] 

45.24 

hi 

57.68 

[0,0.4788,0.5212,0] 

43.22 

We  performed  another  set  of  simulations  for  the  multi¬ 
hypothesis  case.  We  considered  a  ternary  detection  problem, 
where  the  underlying  signal  x  =  0,1,2  needs  to  be  detected 
from  the  available  noisy  data.  We  considered  a  set  of  four 
sensors  and  their  conditional  probability  distribution  is  given 
in  Tables  [IV]  and  [V]  The  processing  time  of  the  sensors  were 
chosen  to  be  the  same  as  in  Table  Q] 

The  set  of  optimal  sensors  were  determined  for  this  set 
of  data.  Monte-Carlo  simulations  were  performed  with  the 
thresholds  rjk,  k  £  {0, . . . ,  M  —  1}  set  at  10-6.  A  comparison 
of  the  uniform  sensor  selection  policy  and  an  optimal  sensor 


TABLE  IV 

Conditional  probabilities  of  measurement  being  zero 


Sensor 

Probability(O) 

Hypothesis  0 

Hypothesis  1 

Hypothesis  2 

i 

0.4218 

0.2106 

0.2769 

2 

0.9157 

0.0415 

0.3025 

3 

0.7922 

0.1814 

0.0971 

4 

0.9595 

0.0193 

0.0061 

TABLE  V 

Conditional  probabilities  of  measurement  being  one 


Sensor 

Probability(l) 

Hypothesis  0 

Hypothesis  1 

Hypothesis  2 

i 

0.1991 

0.6787 

0.2207 

2 

0.0813 

0.7577 

0.0462 

3 

0.0313 

0.7431 

0.0449 

4 

0.0027 

0.5884 

0.1705 

selection  policy  is  presented  in  Table  I VII  Again,  the  significant 
difference  between  the  average  decision  time  in  the  uniform 
and  the  optimal  policy  is  evident. 


TABLE  VI 

Comparisons  of  the  uniform  and  an  optimal  policy 


Policy 

Selection  Probability 

Average  Decision  Time 

9* 

Analytical 

Numerical 

Optimal 

[0,  0.9876,  0,  0.0124] 

38.57 

42.76 

Uniform 

[0.25,  0.25,  0.25,  0.25] 

54.72 

54.14 

We  note  that  the  optimal  results,  we  obtained,  may  only 
be  sub-optimal  because  of  the  asymptotic  approximations 
in  equations  (|3}  and  ©.  We  further  note  that,  for  small 
error  probabilities  and  large  sample  sizes,  these  asymptotic 
approximations  yield  fairly  accurate  results  a,  and  in  fact, 
this  is  the  regime  in  which  it  is  of  interest  to  minimize  the 
expected  decision  time.  Therefore,  for  all  practical  purposes 
the  obtained  optimal  scheme  is  very  close  to  the  actual  optimal 
scheme. 

VII.  CONCLUSIONS 

In  this  paper,  we  considered  the  problem  of  sequential 
decision  making.  We  developed  versions  SPRT  and  MSPRT 
where  the  sensor  switches  at  each  observation.  We  used  these 
sequential  procedures  to  decide  reliably.  We  found  out  the  set 
of  optimal  sensors  to  decide  in  minimum  time.  For  the  binary 
hypothesis  case,  three  performance  metrics  were  considered 
and  it  was  found  that  for  a  generic  set  of  sensors  at  most 
two  sensors  are  optimal.  Further,  it  was  shown  that  for  M 
underlying  hypotheses,  and  a  generic  set  of  sensors,  an  optimal 
policy  requires  at  most  M  sensors  to  be  observed.  A  procedure 
for  identification  of  the  optimal  sensor  was  developed.  In  the 
binary  hypothesis  case,  the  computational  complexity  of  the 
procedure  for  the  three  scenarios,  namely,  the  conditioned 
decision  time,  the  worst  case  decision  time,  and  the  average 
decision  time,  was  0(n),  0(n2),  and  0(n2),  respectively. 

Many  further  extensions  to  the  results  presented  here  are 
possible.  First,  the  time-optimal  scheme  may  not  be  energy 
optimal.  In  particular,  the  time  optimal  set  of  sensors  may  be 
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the  most  distant  sensors  from  the  fusion  center.  Given  that  the 
power  to  transmit  the  signal  to  the  fusion  center  is  proportional 
to  the  distance  from  the  fusion  center,  the  time-optimal  scheme 
is  no  where  close  to  the  energy  optimal  scheme.  This  trade  off 
can  be  taken  care  of  by  adding  a  term  proportional  to  distance 
in  the  objective  function. 

When  we  choose  only  one  or  two  sensors  every  time,  issues 
of  robustness  do  arise.  In  case  of  sensor  failure,  we  need  to 
determine  the  next  best  sensor  to  switch.  A  list  of  sensors, 
with  increasing  decision  time,  could  be  prepared  beforehand 
and  in  case  of  sensor  failure,  the  fusion  center  should  switch 
to  the  next  best  set  of  sensors. 
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